A Weakly Supervised Learning-Based Oversampling Framework for Class-Imbalanced Fault Diagnosis

نویسندگان

چکیده

With the lack of failure data, class imbalance has become a common challenge in fault diagnosis industrial systems. The oversampling methods can tackle class-imbalanced problem by generating minority samples to balance training set. However, one main challenges existing is how generate high-quality samples. Traditional regard all synthetic as ones be added set without filtering. low-quality would distort distribution dataset and worsen classification performance. In this article, we propose weakly supervised method that treats unlabeled develops graph semisupervised learning algorithm select samples, adding into final To improve quality cost-sensitive neighborhood component analysis dimensionality reduction enhance domain information validity high-dimensional datasets. Finally, combining boosting-based ensemble framework, new imbalanced framework suitable for high highly experimental validation performed on five real-world wind turbine blade cracking datasets compared 15 benchmark methods. results show average performances robustness proposed are significantly better than those

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Oversampling for Imbalanced Learning Based on K-Means and SMOTE

Learning from class-imbalanced data continues to be a common and challenging problem in supervised learning as standard classification algorithms are designed to handle balanced class distributions. While different strategies exist to tackle this problem, methods which generate artificial data to achieve a balanced class distribution are more versatile than modifications to the classification a...

متن کامل

Oversampling Method for Imbalanced Classification

Classification problem for imbalanced datasets is pervasive in a lot of data mining domains. Imbalanced classification has been a hot topic in the academic community. From data level to algorithm level, a lot of solutions have been proposed to tackle the problems resulted from imbalanced datasets. SMOTE is the most popular data-level method and a lot of derivations based on it are developed to ...

متن کامل

A Synthetic Minority Oversampling Method Based on Local Densities in Low-Dimensional Space for Imbalanced Learning

Imbalanced class distribution is a challenging problem in many real-life classification problems. Existing synthetic oversampling do suffer from the curse of dimensionality because they rely heavily on Euclidean distance. This paper proposed a new method, called Minority Oversampling Technique based on Local Densities in Low-Dimensional Space (or MOT2LD in short). MOT2LD first maps each trainin...

متن کامل

Imbalanced Multiple Noisy Labeling for Supervised Learning

When labeling objects via Internet-based outsourcing systems, the labelers may have bias, because they lack expertise, dedication and personal preference. These reasons cause Imbalanced Multiple Noisy Labeling. To deal with the imbalance labeling issue, we propose an agnostic algorithm PLAT (Positive LAbel frequency Threshold) which does not need any information about quality of labelers and un...

متن کامل

Semi-Supervised Learning for Imbalanced Sentiment Classification

Various semi-supervised learning methods have been proposed recently to solve the long-standing shortage problem of manually labeled data in sentiment classification. However, most existing studies assume the balance between negative and positive samples in both the labeled and unlabeled data, which may not be true in reality. In this paper, we investigate a more common case of semi-supervised ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Reliability

سال: 2022

ISSN: ['1558-1721', '0018-9529']

DOI: https://doi.org/10.1109/tr.2021.3138448